SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT : Peter DORMER 

(ii) TITLE OF INVENTION: PROTEIN WITH DI FFERENTIATION- INDUCING 
ACTIVITY FOR FRIEND'S ERYTHROLEUKEMIA CELL LINES 

(iii) NUMBER OF SEQUENCES : 10 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LOWE , PRICE, LEBLANC & BECKER 

{ B ) STREET: 99 Canal Center Plaza, Suite 300 

(C) CITY: Alexandria 

(D) STATE: VA 

( E ) COUNTRY : USA 

(F) ZIP: 22314 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS /MS - DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Presta, Frank P. 

(B) REGISTRATION NUMBER: 19,828 

(C) REFERENCE /DOCKET NUMBER: 3428-005 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (703) 684-1111 

(B) TELEFAX: (703) 684-1124 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1495 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mus musculus 



(xij SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
CCGACCGTGC GGACTTAAGA TGGAGGCACT TCCTGTCTGC GGCGGGAAGA GAAGGCTCGG 6 0 

TCGGAGCCGG GAATGCTGGG ACTTGTAGTG CGTAGTCAAT GGTTCTCTAT GGGCTTTCAG 12 0 



AGTGAGTGGC GGGAAGGCGG CCCCGAGGCA TGCTGGGAGT 
TGGTTCTCTA TGGGCTTTCA GAGTGAGTGG CGGGAAGGCG 
TTGTAGTCCT GCCATAGTCA ATGGTTCTCT ATGGGCTTTC 
GGCCCCGAGG CATGCTGGGA GTTGCAGCGC CATGTTTTAA 
GACCTGGCTG TGGATTTTTC GCTAATTCTT TTTTTTAGCT 
TTCACACAGG ATTTCTCTTT ATAGCCTTGG CTACCGTTTT 
TCATTTTGGT TTATTTTTTT TTAATTTTGG TTTTTTTAAG 
CCTGGCTGTG GATTTCTCAC TAATTATTTT TTTTAGCTTT 
CACACAGGAT TTCTCTTTAT AGCCTTGGCT ACCGTTTTTT 
ATTTTGGTTT ATTTTTTAAT TTTAATTTTT GATTTTGGAG 
GCAGCTATGG TTTCTGCCCT AATTATTCTT GTCCTTATTT 
ATTTAATTTA TAATTTTGTT GTAAGTTTTT CTGTGGGCGT 
GTTTCTCTGT TCAGCGTCCG CCGGTCACGG CCGCCGCCCC 
GCAGAAGCGG ACGCCGCGGT CAAGATGTCT CTGCCATGCC 
GGACGGACGG ACGGACTCCA CAAGGTAGGA AGCCTGCGCC 
CACAGCACAC AGGACACACG CGGGCCCCGC GCCCGCCCAG 
ACACACGGCA GGCAGGCCAG GCACACGCAT CCGCAGGACC 
CACGGACGAG CCGCCGCGGT CAAGATGTTC ACCCGCCGCG 
GACCCTCGCC CCGCTGGACG GACGGACGGA CGCACGCACG 
CTGCCGCCGC CCACAGTGAT GTCACCCACG AAAGCACACA 
TCAAGATGTC TCTGCCATCC CCACAGGACG GACGGACGGA 
GCCGAGGCCG CCAGGACGGA GCGATTCTCA CGGAGGAAGG 
CTGCGTACAG ACATGTCCCC CTCAATAAAA TTGCAGTTGA 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: YES 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 155 . . 688 



TGTAGTCCTG CCGTCGTCAA 180 

GCCCCGAGGC ATGCTGGGAG 24 0 

AGACTGAGTG GCGGGAAGGC 300 

AGCACGCGTT TCTCTGTATA 360 

TTATTTTTAA TTTTTACTTT 42 0 

TTCCCTAATT ATTCTCCTTT 480 

ACAGGGTTTC TCTGTATAGA 54 0 

ATTTTTAATT TTTACTTTTT 600 

CCGTAATTAT TCTTATTTTC 66 0 

ACAGGGTTTC TCTTTTAGCC 72 0 

GTAATTTAAT TCTTAATTTA 780 

GAATGGAAAG TCTAACCCGT 84 0 

CAGCGACGTC ACCCACACGC 900 

CACGGGACGC ACGGACGCAC 96 0 

GACCGCACCG CCGCACCCAC 102 0 

GCACACGCGG CACACACGGC 1080 

CGCCGCACCC GCCACGCAGA 114 0 

GTCAAGATGT ATGTGCCACC 12 0 0 

CCGTCAGCGT CCACCGGTCA 12 6 0 

CGTAGAAGCG GACGCCGTGG 13 2 0 

CTCCACAAGG TGCGCGTGTC 13 8 0 

AGCACGCCAA CAGGGCCTGA 14 4 0 

AATGGAAAAA AAAAA 14 9 5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CGCGCCCGCC CGGGATCCCC AGCTGCCGCC GCGCCCGCCC GCCCGCCCGG GGCCCCCGCT 



60 



GCAGAACCGT GACCGTCCGC CGGTCACGGC CGCCGCCCCC AGCGACGTCA CCCACACGCG 12 0 

CAGAAGCGGA CGCCGCGGTC AAGATGTCTC TGCC ATG CCC ACG GGA CGC ACG 172 

Met Pro Thr Gly Arg Thr 
1 5 

GAC GCA CGG ACG GAC GGA CTG ACT CCA CAA GGT AGG AAG CCT GCG CCG 220 

Asp Ala Arg Thr Asp Gly Leu Thr Pro Gin Gly Arg Lys Pro Ala Pro 

10 15 20 

ACC GCA CCG CCG CAC CCA CCA CAG CAC ACA GGA CAC ACG CGG GCC CCG 26 8 

Thr Ala Pro Pro His Pro Pro Gin His Thr Gly His Thr Arg Ala Pro 

25 30 35 

CGC CCG CCC AGG CAC ACG CGG CAC ACA CGG CAC ACA CGG CAG GCA GGC 316 

Arg Pro Pro Arg His Thr Arg His Thr Arg His Thr Arg Gin Ala Gly 

40 45 50 

CAG GCA CAC GCA TCC GCA GGA CCC GCC GCA CCC GCC ACG CAG ACA CGG 3 64 

Gin Ala His Ala Ser Ala Gly Pro Ala Ala Pro Ala Thr Gin Thr Arg 

55 60 65 70 

ACG AGC CGC CGC GGT CAA GAT GTT CAC CCG CCG CGG TCA AGA TGT ATG 412 

Thr Ser Arg Arg Gly Gin Asp Val His Pro Pro Arg Ser Arg Cys Met 

75 80 85 

TGC CAC CGA CCC TCG CCC CGC TGG ACG GAC GGA CGG ACG CGC GCA CGC 46 0 

Cys His Arg Pro Ser Pro Arg Trp Thr Asp Gly Arg Thr Arg Ala Arg 

90 95 100 

CGT CAG CGT CCA CCG GTC ACT GCC GCC GCC CAC AGT GAC GTC ACC CAC 508 

Arg Gin Arg Pro Pro Val Thr Ala Ala Ala His Ser Asp Val Thr His 

105 110 115 

GAA AGC ACA CAC GTA GAA GCG GAC GCC GTG GTC AAG ATG TCT CTG CCA 55 6 

Glu Ser Thr His Val Glu Ala Asp Ala Val Val Lys Met Ser Leu Pro 

120 125 130 

TCC CCA CAG GAC GGA CGG ACG GAC TCC ACA AGG TGC GCG TGT CGC CGA 6 04 

Ser Pro Gin Asp Gly Arg Thr Asp Ser Thr Arg Cys Ala Cys Arg Arg 

135 140 145 150 

GGC CGC CAG GAT GGA GCG ATT CTC ACG GAG GAA GGA GCA CGC CAA CAG 6 52 

Gly Arg Gin Asp Gly Ala lie Leu Thr Glu Glu Gly Ala Arg Gin Gin 

155 160 165 

GGC CTG ACT GCG TAC AGA AAT GCC CCC CCT CAA TAA AATTGCAGTT 6 98 

Gly Leu Thr Ala Tyr Arg Asn Ala Pro Pro Gin * 

170 175 

G AAAT GG AAA AAAAAAA 715 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 177 amino acids 

(B) TYPE: amine acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 

Met Pro Thr Gly Arg Thr Asp Ala Arg Thr Asp Gly Leu Thr Pro Gin 
1 ' 5 10 15 



Gly Arg Lys Pro Ala Pro Thr Ala Pro Pro His Pro Pro Gin His Thr 



20 25 30 

Gly His Thr Arg Ala Pro Arg Pro Pro Arg His Thr Arg His Thr Arg 
35 40 45 

His Thr Arg Gin Ala Gly Gin Ala His Ala Ser Ala Gly Pro Ala Ala 

50 55 60 

Pro Ala Thr Gin Thr Arg Thr Ser Arg Arg Gly Gin Asp Val His Pro 
65 70 75 80 

Pro Arg Ser Arg Cys Met Cys His Arg Pro Ser Pro Arg Trp Thr Asp 

85 90 95 

Gly Arg Thr Arg Ala Arg Arg Gin Arg Pro Pro Val Thr Ala Ala Ala 
100 105 110 

His Ser Asp Val Thr His Glu Ser Thr His Val Glu Ala Asp Ala Val 
115 120 125 

Val Lys Met Ser Leu Pro Ser Pro Gin Asp Gly Arg Thr Asp Ser Thr 
130 135 140 

Arg Cys Ala Cys Arg Arg Gly Arg Gin Asp Gly Ala lie Leu Thr Glu 
145 150 155 160 

Glu Gly Ala Arg Gin Gin Gly Leu Thr Ala Tyr Arg Asn Ala Pro Pro 
165 170 175 

Gin 

(2) INFORMATION FOR SEQ ID NO : 4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA to mRNA 
(iii) HYPOTHETICAL: YES 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 63 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ATG GGG CTG CAG AAC CGT GAC CGT CCG CCG GTC ACG GCC GCC GCC CCC 4 8 

Met Gly Leu Gin Asn Arg Asp Arg Pro Pro Val Thr Ala Ala Ala Pro 

180 185 190 

AGC GAC GTC ACC CAC ACG CGC AGA AGC GGA CGC CGC GGT CAA GAT GTC 9 6 

Ser Asp Val Thr His Thr Arg Arg Ser Gly Arg Arg Gly Gin Asp Val 
195 200 205 210 

TCT GCC ATG CCC ACG GGA CGC ACG GAC GCA CGG ACG GAC GGA CTG ACT 14 4 

Ser Ala Met Pro Thr Gly Arg Thr Asp Ala Arg Thr Asp Gly Leu Thr 
215 220 225 

CCA CAA GGT AGG AAG CCT GCG CCG ACC GCA CCG CCG CAC CCA CCA CAG 192 
Pro Gin Gly Arg Lys Pro Ala Pro Thr Ala Pro Pro His Pro Pro Gin 
230 235 240 



CAC ACA GGA CAC ACG CGG GCC CCG CGC CCG CCC AGG CAC ACG CGG CAC 24 0 

His Thr Gly His Thr Arg Ala Pro Arg Pro Pro Arg His Thr Arg His 

245 250 255 

ACA CGG CAC ACA CGG CAG GCA GGC CAG GCA CAC GCA TCC GCA GGA CCC 288 

Thr Arg His Thr Arg Gin Ala Gly Gin Ala His Ala Ser Ala Gly Pro 
260 265 270 

GCC GCA CCC GCC ACG CAG ACA CGG ACG AGC CGC CGC GGT CAA GAT GTT 3 36 

Ala Ala Pro Ala Thr Gin Thr Arg Thr Ser Arg Arg Gly Gin Asp Val 
275 280 285 290 

CAC CCG CCG CGG TCA AGA TGT ATG TGC CAC CGA CCC TCG CCC CGC TGG 3 84 

His Pro Pro Arg Ser Arg Cys Met Cys His Arg Pro Ser Pro Arg Trp 
295 300 305 

ACG GAC GGA CGG ACG CGC GCA CGC CGT CAG CGT CCA CCG GTC ACT GCC 4 32 

Thr Asp Gly Arg Thr Arg Ala Arg Arg Gin Arg Pro Pro Val Thr Ala 

310 315 320 

GCC GCC CAC AGT GAC GTC ACC CAC GAA AGC ACA CAC GTA GAA GCG GAC 480 

Ala Ala His Ser Asp Val Thr His Glu Ser Thr His Val Glu Ala Asp 

325 330 335 

GCC GTG GTC AAG ATG TCT CTG CCA TCC CCA CAG GAC GGA CGG ACG GAC 52 8 

Ala Val Val Lys Met Ser Leu Pro Ser Pro Gin Asp Gly Arg Thr Asp 
340 345 350 

TCC ACA AGG TGC GCG TGT CGC CGA GGC CGC CAG GAT GGA GCG ATT CTC 57 6 

Ser Thr Arg Cys Ala Cys Arg Arg Gly Arg Gin Asp Gly Ala lie Leu 

355 360 365 370 

ACG GAG GAA GGA GCA CGC CAA CAG GGC CTG ACT GCG TAC AGA AAT GCC 624 

Thr Glu Glu Gly Ala Arg Gin Gin Gly Leu Thr Ala Tyr Arg Asn Ala 
375 380 385 

CCC CCT CAA TAA 63 6 

Pro Pro Gin * 
390 



(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 

Met Gly Leu Gin Asn Arg Asp Arg Pro Pro Val Thr Ala Ala Ala Pro 
15 10 15 

Ser Asp Val Thr His Thr Arg Arg Ser Gly Arg Arg Gly Gin Asp Val 
20 25 30 

Ser Ala Met Pro Thr Gly Arg Thr Asp Ala Arg Thr Asp Gly Leu Thr 

35 40 45 

Pro Gin Gly Arg Lys Pro Ala Pro Thr Ala Pro Pro His Pro Pro Gin 

50 55 60 

His Thr Gly His Thr Arg Ala Pro Arg Pro Pro Arg His Thr Arg His 
65 70 75 80 

Thr Arg His Thr Arg Gin Ala Gly Gin Ala His Ala Ser Ala Gly Pro 



85 90 95 

Ala Ala Pro Ala Thr Gin Thr Arg Thr Ser Arg Arg Gly Gin Asp Val 
100 105 110 

His Pro Pro Arg Ser Arg Cys Met Cys His Arg Pro Ser Pro Arg Trp 
115 120 125 

Thr Asp Gly Arg Thr Arg Ala Arg Arg Gin Arg Pro Pro Val Thr Ala 
130 135 140 

Ala Ala His Ser Asp Val Thr His Glu Ser Thr His Val Glu Ala Asp 
145 150 155 160 

Ala Val Val Lys Met Ser Leu Pro Ser Pro Gin Asp Gly Arg Thr Asp 
165 170 175 

Ser Thr Arg Cys Ala Cys Arg Arg Gly Arg Gin Asp Gly Ala lie Leu 
180 185 190 

Thr Glu Glu Gly Ala Arg Gin Gin Gly Leu Thr Ala Tyr Arg Asn Ala 
195 200 205 

Pro Pro Gin 
210 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 42 base pairs 
{ B ) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mus musculus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 
CGTCCGCCGG TCACGGCCGC CGCCCCCAGC GACGTCACCC AC 4 2 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

iii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mus musculus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
AGAAGCGGAC GCCGCGGTCA AGATGTCTCT GCCATGCCCA CGGGACGCAC GGACG 55 
(2) INFORMATION FOR SEQ ID NO : 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA to mRNA 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mus musculus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
TAGTCCTGCC GTCGTCAATG GTTCTCTATG GGCTTTCAGA GTGAGTGGCG GGAAGGCGGC 6 0 

CCCGAGGCAT GCTGGGAGTT G 81 
(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mus musculus 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GTTTCTCTGT ATAGACCTGG CTGTGGATTT TTCGCTAATT CTTTTTTTTA GCTTTATTTT 6 0 

TAATTTTTAC TTTTTCACAC AGGATTTCTC TTTATAGCCT TGGCTACCGT TTTTTCCCTA 12 0 

ATTATTCTCC TTTTCATTTT GGTTTATTTT TTTTTAATTT TGGTTTTTTT AAGACAGG 178 
(2) INFORMATION FOR SEQ ID NO : 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 385 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D.) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) HYPOTHETICAL: YES 



(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mus musculus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

AGGAAGCCTG CGCCGACCGC ACCGCCGCAC CCACCACAGC ACACAGGACA CACGCGGGCC 6 0 

CCGCGCCCGC CCAGGCACAC GCGGCACACA CGGCACACAC GGCAGGCAGG CCAGGCACAC 12 0 

GCATCCGCAG GACCCGCCGC ACCCGCCACG CAGACACGGA CGAGCCGCCG CGGTCAAGAT 18 0 

GTTCACCCGC CGCGGTCAAG ATGTATGTGC CACCGACCCT CGCCCCGCTG GACGGACGGA 24 0 

CGGACGCACG CACGCCGTCA GCGTCCACCG GTCACTGCCG CCGCCCACAG TGATGTCACC 300 

CACGAAAGCA CACACGTAGA AGCGGACGCC GTGGT CAAGA TGTCTCTGCC ATCCCCACAG 360 

GACGGACGGA CGGACTCCAC AAGGT 385 



